Acoustic event recognition using dominant spectral basis vectors
نویسندگان
چکیده
This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.
منابع مشابه
Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کاملHeterogeneous acoustic measurements and multiple classifiers for speech recognition
The acoustic-phonetic modeling component of most current speech recognition systems calculates a small set of homogeneous frame-based measurements at a single, fixed time-frequency resolution. This thesis presents evidence indicating that recognition performance can be significantly improved through a contrasting approach using more detailed and more diverse acoustic measurements, which we refe...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملMinimum mean square error spectral peak envelope estimation for automatic vowel classification
Spectral feature computations continue to be a very difficult problem for accurate machine recognition of vowels especially in the presence of noise or for otherwise degraded acoustic signals. In this work, a new peak envelope method for vowel classification is developed, based on a missing frequency components model of speech recognition. According to this model, vowel recognition depends only...
متن کاملA Robust Environmental Sound Recognition System using BPNN and RBFNN
Abstract— In a reverberant environment, the performance of acoustic event recognition system can be bolstered by choosing appropriate feature descriptors and classifier techniques. Neural networks are by far providing stunning classification results when compared to other classifiers. This paper analyses two different neural networks and their precision when they both stumble upon same targets ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015